skip to main content


Search for: All records

Creators/Authors contains: "Larracuente, Amanda"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Bateman, J (Ed.)
    Abstract

    Large genome structural variations can impact genome regulation and integrity. Repeat-rich regions like pericentric heterochromatin are vulnerable to structural rearrangements although we know little about how often these rearrangements occur over evolutionary time. Repetitive genome regions are particularly difficult to study with genomic approaches, as they are missing from most genome assemblies. However, cytogenetic approaches offer a direct way to detect large rearrangements involving pericentric heterochromatin. Here, we use a cytogenetic approach to reveal large structural rearrangements associated with the X pericentromeric region of Drosophila simulans. These rearrangements involve large blocks of satellite DNA—the 500-bp and Rsp-like satellites—which colocalize in the X pericentromeric heterochromatin. We find that this region is polymorphic not only among different strains, but between isolates of the same strain from different labs, and even within individual isolates. On the one hand, our observations raise questions regarding the potential impact of such variation at the phenotypic level and our ability to control for such genetic variability. On the other hand, this highlights the very rapid turnover of the pericentric heterochromatin most likely associated with genomic instability of the X pericentromere. It represents a unique opportunity to study the dynamics of pericentric heterochromatin, the evolution of associated satellites on a very short time scale, and to better understand how structural variation arises.

     
    more » « less
    Free, publicly-accessible full text available September 28, 2024
  2. Repetitive elements (REs) are integral to the composition, structure, and function of eukaryotic genomes, yet remain understudied in most taxonomic groups. We investigated REs across 601 insect species and report wide variation in RE dynamics across groups. Analysis of associations between REs and protein-coding genes revealed dynamic evolution at the interface between REs and coding regions across insects, including notably elevated RE–gene associations in lineages with abundant long interspersed nuclear elements (LINEs). We leveraged this large, empirical data set to quantify impacts of long-read technology on RE detection and investigate fundamental challenges to RE annotation in diverse groups. In long-read assemblies, we detected ∼36% more REs than short-read assemblies, with long terminal repeats (LTRs) showing 162% increased detection, whereas DNA transposons and LINEs showed less respective technology-related bias. In most insect lineages, 25%–85% of repetitive sequences were “unclassified” following automated annotation, compared with only ∼13% inDrosophilaspecies. Although the diversity of available insect genomes has rapidly expanded, we show the rate of community contributions to RE databases has not kept pace, preventing efficient annotation and high-resolution study of REs in most groups. We highlight the tremendous opportunity and need for the biodiversity genomics field to embrace REs and suggest collective steps for making progress toward this goal.

     
    more » « less
    Free, publicly-accessible full text available October 1, 2024
  3. Barbash, Daniel A (Ed.)
    Advances in genomic technology led to a more focused pattern for the distribution of chromosomal proteins and a better understanding of their functions. The recent development of the CUT&RUN technique marks one of the important such advances. Here we develop a modified CUT&RUN technique that we termed nanoCUT&RUN, in which a high affinity nanobody to GFP is used to bring micrococcal nuclease to the binding sites of GFP-tagged chromatin proteins. Subsequent activation of the nuclease cleaves the chromatin, and sequencing of released DNA identifies binding sites. We show that nanoCUT&RUN efficiently produces high quality data for the TRL transcription factor in Drosophila embryos, and distinguishes binding sites specific between two TRL isoforms. We further show that nanoCUT&RUN dissects the distributions of the HipHop and HOAP telomere capping proteins, and uncovers unexpected binding of telomeric proteins at centromeres. nanoCUT&RUN can be readily applied to any system in which a chromatin protein of interest, or its isoforms, carries the GFP tag. 
    more » « less
  4. Y chromosomes across diverse species convergently evolve a gene-poor, heterochromatic organization enriched for duplicated genes, LTR retrotransposons, and satellite DNA. Sexual antagonism and a loss of recombination play major roles in the degeneration of young Y chromosomes. However, the processes shaping the evolution of mature, already degenerated Y chromosomes are less well-understood. Because Y chromosomes evolve rapidly, comparisons between closely related species are particularly useful. We generated de novo long-read assemblies complemented with cytological validation to reveal Y chromosome organization in three closely related species of the Drosophila simulans complex, which diverged only 250,000 years ago and share >98% sequence identity. We find these Y chromosomes are divergent in their organization and repetitive DNA composition and discover new Y-linked gene families whose evolution is driven by both positive selection and gene conversion. These Y chromosomes are also enriched for large deletions, suggesting that the repair of double-strand breaks on Y chromosomes may be biased toward microhomology-mediated end joining over canonical non-homologous end-joining. We propose that this repair mechanism contributes to the convergent evolution of Y chromosome organization across organisms. 
    more » « less
  5. Hoffmann, Federico (Ed.)
    Abstract The first insect genome assembly (Drosophila melanogaster) was published two decades ago. Today, nuclear genome assemblies are available for a staggering 601 insect species representing 20 orders. In this study, we analyzed the most-contiguous assembly for each species and provide a “state-of-the-field” perspective, emphasizing taxonomic representation, assembly quality, gene completeness, and sequencing technologies. Relative to species richness, genomic efforts have been biased toward four orders (Diptera, Hymenoptera, Collembola, and Phasmatodea), Coleoptera are underrepresented, and 11 orders still lack a publicly available genome assembly. The average insect genome assembly is 439.2 Mb in length with 87.5% of single-copy benchmarking genes intact. Most notable has been the impact of long-read sequencing; assemblies that incorporate long reads are ∼48× more contiguous than those that do not. We offer four recommendations as we collectively continue building insect genome resources: 1) seek better integration between independent research groups and consortia, 2) balance future sampling between filling taxonomic gaps and generating data for targeted questions, 3) take advantage of long-read sequencing technologies, and 4) expand and improve gene annotations. 
    more » « less
  6. The rapid evolution of repetitive DNA sequences, including satellite DNA, tandem duplications, and transposable elements, underlies phenotypic evolution and contributes to hybrid incompatibilities between species. However, repetitive genomic regions are fragmented and misassembled in most contemporary genome assemblies. We generated highly contiguous de novo reference genomes for the Drosophila simulans species complex ( D. simulans , D. mauritiana , and D. sechellia ), which speciated ∼250,000 yr ago. Our assemblies are comparable in contiguity and accuracy to the current D. melanogaster genome, allowing us to directly compare repetitive sequences between these four species. We find that at least 15% of the D. simulans complex species genomes fail to align uniquely to D. melanogaster owing to structural divergence—twice the number of single-nucleotide substitutions. We also find rapid turnover of satellite DNA and extensive structural divergence in heterochromatic regions, whereas the euchromatic gene content is mostly conserved. Despite the overall preservation of gene synteny, euchromatin in each species has been shaped by clade- and species-specific inversions, transposable elements, expansions and contractions of satellite and tRNA tandem arrays, and gene duplications. We also find rapid divergence among Y-linked genes, including copy number variation and recent gene duplications from autosomes. Our assemblies provide a valuable resource for studying genome evolution and its consequences for phenotypic evolution in these genetic model species. 
    more » « less
  7. Abstract Seminal fluid proteins (SFPs) mediate an array of postmating reproductive processes that influence fertilization and fertility. As such, it is widely held that SFPs may contribute to postmating, prezygotic reproductive barriers between closely related taxa. We investigated seminal fluid (SF) diversification in a recently diverged passerine species pair (Passer domesticus and Passer hispaniolensis) using a combination of proteomic and comparative evolutionary genomic approaches. First, we characterized and compared the SF proteome of the two species, revealing consistencies with known aspects of SFP biology and function in other taxa, including the presence and diversification of proteins involved in immunity and sperm maturation. Second, using whole-genome resequencing data, we assessed patterns of genomic differentiation between house and Spanish sparrows. These analyses detected divergent selection on immunity-related SF genes and positive selective sweeps in regions containing a number of SF genes that also exhibited protein abundance diversification between species. Finally, we analyzed the molecular evolution of SFPs across 11 passerine species and found a significantly higher rate of positive selection in SFPs compared with the rest of the genome, as well as significant enrichments for functional pathways related to immunity in the set of positively selected SF genes. Our results suggest that selection on immunity pathways is an important determinant of passerine SF composition and evolution. Assessing the role of immunity genes in speciation in other recently diverged taxa should be prioritized given the potential role for immunity-related proteins in reproductive incompatibilities in Passer sparrows. 
    more » « less
  8. Abstract

    Study of repetitive DNA elements in model organisms highlights the role of repetitive elements (REs) in many processes that drive genome evolution and phenotypic change. Because REs are much more dynamic than single‐copy DNA, repetitive sequences can reveal signals of evolutionary history over short time scales that may not be evident in sequences from slower‐evolving genomic regions. Many tools for studying REs are directed toward organisms with existing genomic resources, including genome assemblies and repeat libraries. However, signals in repeat variation may prove especially valuable in disentangling evolutionary histories in diverse non‐model groups, for which genomic resources are limited. Here, we introduce RepeatProfiler, a tool for generating, visualizing, and comparing repetitive element DNA profiles from low‐coverage, short‐read sequence data. RepeatProfiler automates the generation and visualization of RE coverage depth profiles (RE profiles) and allows for statistical comparison of profile shape across samples. In addition, RepeatProfiler facilitates comparison of profiles by extracting signal from sequence variants across profiles which can then be analysed as molecular morphological characters using phylogenetic analysis. We validate RepeatProfiler with data sets from ground beetles (Bembidion), flies (Drosophila), and tomatoes (Solanum). We highlight the potential of RE profiles as a high‐resolution data source for studies in species delimitation, comparative genomics, and repeat biology.

     
    more » « less
  9. Abstract A long-standing enigma concerns the geographic and ecological origins of the intensively studied vinegar fly, Drosophila melanogaster. This globally distributed human commensal is thought to originate from sub-Saharan Africa, yet until recently, it had never been reported from undisturbed wilderness environments that could reflect its precommensal niche. Here, we document the collection of 288 D. melanogaster individuals from multiple African wilderness areas in Zambia, Zimbabwe, and Namibia. The presence of D. melanogaster in these remote woodland environments is consistent with an ancestral range in southern-central Africa, as opposed to equatorial regions. After sequencing the genomes of 17 wilderness-collected flies collected from Kafue National Park in Zambia, we found reduced genetic diversity relative to town populations, elevated chromosomal inversion frequencies, and strong differences at specific genes including known insecticide targets. Combining these genomes with existing data, we probed the history of this species’ geographic expansion. Demographic estimates indicated that expansion from southern-central Africa began ∼10,000 years ago, with a Saharan crossing soon after, but expansion from the Middle East into Europe did not begin until roughly 1,400 years ago. This improved model of demographic history will provide an important resource for future evolutionary and genomic studies of this key model organism. Our findings add context to the history of D. melanogaster, while opening the door for future studies on the biological basis of adaptation to human environments. 
    more » « less